Conference Proceedings
Taking Risks with Confidence
Rodger Benham, Ben Carterette, Alistair Moffat, J Shane Culpepper
Proceedings of the 24th Australasian Document Computing Symposium on - ADCS '19 | ACM Press | Published : 2019
Abstract
Risk-based evaluation is a failure analysis tool that can be combined with traditional effectiveness metrics to ensure that the improvements observed are consistent across topics when comparing systems. Here we explore the stability of confidence intervals in inference-based risk measurement, extending previous work to five different commonly used inference testing techniques. Using the Robust04 and TREC Core 2017 NYT corpora, we show that risk inferences using parametric methods appear to disagree with their non-parametric counterparts, warranting further investigation. Additionally, we explore how the number of topics being evaluated affects confidence interval stability, and find that mor..
View full abstractRelated Projects (1)
Grants
Awarded by Australian Research Council
Funding Acknowledgements
The first author was supported by an RMIT VCPS Scholarship. The third and fourth authors were supported by the Australian Research Council (project DP190101113). Christina Knudson assisted by locating some relevant material.